A Multi-level Framework for Video Shot Structuring
نویسندگان
چکیده
Video shots provide the most basic meaningful segments for video analysis and understanding. In this paper, we present a detection and classification framework for the video shot segmentation in a coarse-to-fine fashion. The initial transitions are detected from a sub-sampled video space. These coarse segments are later refined in the original video space with the technique of illumination artifacts removal and transition finalization. The transition type (abrupt or gradual) are finally determined by exploiting the histogram intersection plot. The proposed method has been tested on a large amount of videos, which contain a variety of types of shot transitions. Accurate and competitive results have been obtained.
منابع مشابه
Multi-Label Zero-Shot Human Action Recognition via Joint Latent Embedding
Human action recognition refers to automatic recognizing human actions from a video clip, which is one of the most challenging tasks in computer vision. Due to the fact that annotating video data is laborious and timeconsuming, most of the existing works in human action recognition are limited to a number of small scale benchmark datasets where there are a small number of video clips associated...
متن کاملNon-sequential video structuring based on video object linking: an efficient tool for video browsing and indexing
An efficient system for unsupervised structuring of stereoscopic sequences is presented in this paper, which generates links between similar VOPs of different shots. Particularly after shot cut detection, for each shot, a fast, unsupervised VOP detection and tracking algorithm is applied. Then for each of the foreground VOPs of a frame, a feature vector is constructed using low level features o...
متن کاملAn unified framework for shot boundary detection via active learning
Video shot boundary detection is an important step in many video processing applications. We observe that video shot boundary is a multi-resolution edge phenomenon in the feature space. In this paper, we expanded our previous temporal multi-resolution analysis (TMRA) work by introducing the new feature vector based on motion. Further we employ the support vector machine (SVM) to refine the clas...
متن کاملTemporal Multi-Resolution Framework for Shot Boundary Detection and Keyframe Extraction
Video shot boundary detection and keyframe extraction is an important step in many video-processing applications. We observe that video shot boundary is a multi-resolution edge phenomenon in the feature space. In this experiment, we expanded our previous temporal multi-resolution analysis (TMRA) work by introducing the new feature vector based on motion, incorporating functions to detect flash ...
متن کاملSemantic Shot Classification in Sports Video
In this paper, we present a unified framework for semantic shot classification in sports videos. Unlike previous approaches, which focus on clustering by aggregating shots with similar low-level features, the proposed scheme makes use of domain knowledge of specific sport to perform a top-down video shot classification, including identification of video shot classes for each sport, and supervis...
متن کامل